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Abstract 

A search for the standard model Higgs boson decaying into two Z bosons with sub- 
sequent decay into a final state containing two quark jets and two leptons, H 
2z(*) — > qqi^i^ is presented. Results are based on data corresponding to an inte- 
grated luminosity of 4.6 fb~^ of proton-proton collisions at ^/s = 7 TeV, collected with 
the CMS detector at the LHC. In order to discriminate between signal and background 
events, kinematic and topological quantities, including the angular spin correlations 
of the decay products, are employed. Events are further classified according to the 
probability of the jets to originate from quarks of light or heavy flavor or from glu- 
ons. No evidence for the Higgs boson is found, and upper limits on its production 
cross section are determined for a Higgs boson of mass between 130 and 600 GeV. 
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1 Introduction 

An important goal of experiments at the Large Hadron Collider (LHC) IH is to study the mech- 
anism of electroweak symmetry breaking through which the weak W and Z bosons acquire 
mass while the photon, 7, remains massless. Within the standard model (SM) |12H4| of particle 
physics it is postulated that the Higgs field provides the mechanism of electroweak symme- 
try breaking [5-lOJ. This model also predicts that the Higgs field would give rise to a spin- 
zero Higgs boson (H) with quantum numbers of the vacuum, J^*- = 0++. Limits set by the 
experiments at LEP (TT\ and the Tevatron [12J leave a wide range of allowed Higgs boson 
masses nta > 114.4 GeV and ntu ^ [162,166] GeV at 95% confidence level (CL). Recently, fur- 
ther limits were set by the ATLAS experiment IITSHTSlI at the LHC: mn ^ [145,206], [214,224], 
and [340,450] GeV. Indirect measurements |T6l suggest that the mass of a SM Higgs boson 
would most likely fall below 158 GeV at 95% CL. 

At the LHC, within the SM, Higgs bosons are primarily produced by gluon fusion (gg) fi7\ - 
|26| with an additional small contribution due to weak vector boson fusion (VBF) [27-321 and 
smaller contributions from other processes. The decay of a Higgs boson to two light fermions 
is highly suppressed Il33ti36| . Decay channels of the SM Higgs boson with two gauge bosons 
in the final state provide the greatest discovery potential at the LHC. For a Higgs boson mass 
m-j-i < Iniy^j those final states contain two photons or two weak bosons, ZZ* or WW*, where in 
each case one of the gauge bosons is off mass shell. For wh > 2mw/ the main final states are 
those with two on-mass-shell weak bosons: W+W^ for 2mw < w^h < 2mz, and additionally 
ZZ for ffjH > 2mz- 

In this Letter we present a search for a SM-like Higgs boson decaying via two Z bosons, one of 
which could be off mass shell, with a subsequent decay into two quark jets and two leptons, 
H — > ZZ(*) — > qqi'^£+. Constraints on the rate of the Higgs boson production and decay 
are presented as a function of mass and interpretations are given in two scenarios: SM and a 
model with four generations of fermions Il37l - l4l| . The branching fraction of this decay channel 
is about 20 times higher than that of H — >• ZZ^*) — )■ £^£^£^£^. Inclusion of this semileptonic fi- 
nal state in the search for the Higgs boson leads to improved sensitivity at higher masses, where 
kinematic requirements can effectively suppress background. In the low mass region with lep- 
tonically decaying off-mass-shell Z bosons, we can achieve effective background suppression 
by constraining the two jets to the known Z boson mass niz [42J. The search is performed with 
a sample of proton-proton collisions at a center-of-mass energy = 7 TeV corresponding to 
an integrated luminosity C = (4.6 ± 0.2) fb^^ recorded by the Compact Muon Solenoid (CMS) 
experiment l|43| at the LHC during 2011. 

2 Event Reconstruction 

We search for a fully reconstructed decay chain of the Higgs boson H —> ZZ^*^ —> qql^i^, 
see figure [l| where the charged leptons are either muons or electrons and the quarks are 
identified as jets in the CMS detector. The search is optimized separately for two ranges of 
the reconstructed mass, 125 < mzz < 170 GeV (low-mass) and 183 < mzz < 800 GeV (high- 
mass), corresponding to the H — > ZZ* and H — > ZZ analyses, respectively. The intermediate 
mass range between 2mw < < 2mz has reduced sensitivity because of the small branching 
fraction for H — > ZZ and is not included in the analysis. 

A detailed description of the CMS detector can be found in ref. ||43| . In the cylindrical co- 
ordinate system of CMS, (p is the azimuthal angle and the pseudorapidity (rj) is defined as 
Tj = — ln[tan(0/2)], where d is the polar angle with respect to the counterclockwise beam di- 
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2 Event Reconstruction 




Figure 1: Diagram describing the process pp — )• H + X — )• ZZ^*) + X — )■ qq£^£+ + X ir\ terms of 
the angles {9*, Oi, 6-[, 62, 0) defined in the parent particle rest frames (H or Z), where X indicates 
other products of the pp collision not shown on the diagram [,44] . 



rection. The central feature of the CMS detector is a 3.8 T superconducting solenoid of 6 m 
internal diameter. Within the field volume are the silicon tracker, the crystal electromagnetic 
calorimeter (ECAL), and the brass-scintillator hadron calorimeter (HCAL). The muon system 
is installed outside the solenoid and embedded in the steel return yoke. The CMS tracker 
consists of silicon pixel and silicon strip detector modules, covering the pseudorapidity range 
\f]\ < 2.5. The ECAL consists of lead tungstate crystals, which provide coverage for pseudo- 
rapidity \rj\ < 1.5 in the central barrel region and 1.5 < \ri\ < 3.0 in the two forward endcap 
regions. The HCAL consists of a set of sampling calorimeters which utilize alternating lay- 
ers of brass as absorber and plastic scintillator as active material. The muon system includes 
barrel drift tubes covering the pseudorapidity range \rj\ < 1.2, endcap cathode strip chambers 
(0.9 < \rj\ < 2.5), and resistive plate chambers (|?/| < 1.6). 

Although the main sources of background are estimated from data, Monte Carlo (MC) simu- 
lations are used to develop and validate the methods used in the analysis. Background sam- 
ples are generated using either MadGraph 4.4.12 ||45| (inclusive Z and top-quark production), 
ALPGEN 2.13 [46l (inclusive Z production), POWHEG | |4^j49l (top-quark production), or PYTHIA 
6.4.22 [501 (ZZ, WZ, WW, QCD production). Signal events are generated using POWHEG and 
a dedicated generator from ref. [441 . Parton distribution functions (PDF) are modeled using 
the parametrization CTEQ6 IISTI at leading order (LO) and CTIO ||52| at next-to-leading order 
(NLO). For both signal and background MC, events are simulated using a GEANT4 [53 J based 
model of the CMS detector and processed using the same reconstruction algorithms as used for 
data. 

Muons are measured with the tracker and the muon system. Electrons are detected as tracks in 
the tracker pointing to energy clusters in the ECAL. Both muons and electrons are required to 
have a momentum transverse to the pp beam direction, pj, greater than 20 GeV and 10 GeV, for 
the leading and subleading pj lepton, respectively. These requirements are tightened to 40 GeV 
and 20 GeV in the analysis of the H candidates at higher masses. Leptons are measured in the 
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pseudorapidity range \r]\ < 2.4 for muons, and \t]\ < 2.5 for electrons, although for electrons 
the transition range between the barrel and endcap, 1.44 < \ri\ < 1.57, is excluded. Both the 
Pt and t] requirements are consistent with those in the online trigger selection requiring two 
charged leptons, either electrons or muons. In the high-mass analysis, we also accept events 
selected with a single-muon trigger. The details of electron and muon identification criteria 
are described elsewhere |54|. Muons are required to be isolated from hadronic activity in the 
detector by restricting the sum of transverse momentum or energy in the tracker, ECAL, and 
HCAL, within a surrounding cone of AR = y(A;/)2 + (A(/))2 < 0.3, to be less than 15% of 
the measured pj of the muon, where Arj and Ai^ are the differences in pseudorapidity and in 
azimuthal angle measured from the trajectory of the muon. Electron isolation requirements 
are similar but vary depending on the shape of the electron shower. In both cases the energy 
associated with the lepton is excluded from the isolation sum. 

Jets are reconstructed with the particle-flow (PF) algorithm ||55| , which is an event reconstruc- 
tion technique with the aim of reconstructing all particles produced in a given collision event 
through the combination of information from all sub-detectors. Reconstructed particle candi- 
dates are clustered to form PF jets with the anti-A:^ algorithm [i56ii57i with the distance param- 
eter R = 0.5. The HCAL, ECAL, and tracker data are combined in the PF algorithm to measure 
jets. Jets that overlap with isolated leptons within AR = 0.5 are removed from consideration. 

Jets are required to be inside the tracker acceptance, thus allowing high reconstruction effi- 
ciency and precise energy measurements using PF algorithm. Jet-energy corrections are applied 
to account for the non-linear response of the calorimeters to the particle energies and other in- 
strumental effects. These corrections are based on in-situ measurements using dijet and 7 + jet 
data samples | |58| . Overlapping minimum bias events (pile-up) coming from different proton- 
proton collisions and the underlying event have an effect on jet reconstruction by contributing 
additional energy to the reconstructed jets. The median energy density resulting from pile-up 
is evaluated in each event, and the corresponding energy is subtracted from each jet | 59J . A jet 
requirement, primarily based on the energy balance between charged and neutral hadrons in a 
jet, is applied to remove misidentified jets. All jets are required to have pj > 30 GeV. 

Each pair of oppositely charged leptons and each pair of jets are considered as Z candidates. 
Background suppression is primarily based on the dilepton and dijet invariant masses, m(( and 
mjj. The requirement 75 < nijj < 105 GeV is applied in order to reduce the Z+jets background 
and 70 < niK < 110 GeV to reduce background without a Z in the final state, such as tt. 
Figure |2|(a) shows the dijet invariant mass nijj distribution for signal and background. In the 
search for the Higgs boson in the final state ZZ*, we require the invariant mass of the Z* — > 
i^£^ candidate to be less than 80 GeV instead of the previous requirement. Below threshold 
for on-shell production of ZZ, the signal cross section is much smaller but also the Z*/7*+jets 
background is strongly reduced. 

The statistical analysis is based on the invariant mass of the Higgs boson candidate, nizz, which 
is calculated using a fit of the final state four momenta and applying the constraint that the 
dijet invariant mass is consistent with the mass of the Z boson. The experimental resolutions 
are taken into account in this fit. 

Since the Higgs boson is spinless, the angular distribution of its decay products is independent 
of the production mechanism. Five angles {9* ,^1,61,62,^) defined in ref. Il44ll and in figure [l] 
fully describe the kinematics of the gg — )■ H — )■ ZZ^*) —> qqi^i^ process. Further kinematic 
selection exploits these five angular observables, which are only weakly correlated with the 
invariant masses of the H and the two Z bosons and with the longitudinal and transverse mo- 
menta of the Higgs boson candidate. The five angles along with the invariant masses provide 
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2 Event Reconstruction 



most of the discriminating power between signal and background. We construct an angu- 
lar likelihood discriminant (LD) based on the probability ratio of the signal and background 
hypotheses 'Psig/ (^sig + ^sig)/ as described in ref. [44J. The likelihood ratio is defined for each 
value of mzz arid its dependence on mzz is parameterized with smooth functions. Distributions 
of the angular LD for signal and background are shown in figure |2](b). The signal probability 
distribution is a correlated five-dimensional angular parameterization multiplied by empiri- 
cally determined polynomial acceptance functions from simulation that describe non-uniform 
reconstruction efficiencies in the detector. The background distribution is an empirical parame- 
terization taken as a product of independent distributions for each observable using simulation. 
Both are parameterized as functions of mzz- Cuts on the angular LD are chosen to optimize the 
expected sensitivity to the production of a SM Higgs boson and depend on mzz- The angular 
LD was found to have marginal separation power for mzz < 170 GeV and therefore is not used 
in selection requirements for this low-mass range. 

The parton type of the jets provides a powerful tool for background discrimination. In signal 
events, the jets originate from Z bosons decaying to quarks that subsequently hadronize. The 
flavor of quarks in Z decays is almost equally distributed among the five types d, u, s, c, h, 
with some preference given to the down-type quarks. The dominant background is a lepton- 
ically decaying Z boson produced in association with high-px jets, a process in which gluon 
radiation plays a major role. Beside gluons, the uand dquarks from the protons dominate the 
jet production associated with the Z. Therefore, the main features that discriminate signal from 
background are the relatively large contribution of heavy-flavor quarks (band s) and the ab- 
sence of gluons. We take advantage of both features in the analysis by tagging the b flavor and 
introducing a likelihood discriminant that separates gluon and light-quark jets on a statistical 
basis, as described below. 

To identify jets originating from the hadronization of bottom quarks, we use the CMS track 
counting high-efficiency (TCHE) b-tagging algorithm [60,. 61 1, which relies on tracks with large 
impact parameters. A jet is b-tagged if there are at least two tracks each with a three-dimensional 
impact-parameter significance larger than a given threshold which has been optimized. The 
distributions of the resulting b-tagging discriminant is shown in figure|2](c). The data are split 
into three b-tag categories: a 2 b-tag category is required to have one jet identified with medium 
(~65% efficiency) and the other jet with loose (~80% efficiency) TCHE requirements; events not 
selected in the 2 b-tag category are categorized as 1 b-tag if they have at least one jet satisfying 
the loose-tag requirements; the b-tag category contains all the remaining events. The com- 
position of the expected signal and background events varies significantly among the three 
categories, see figure |2](d). 

The b-tag category is dominated by the Z+jets background, and from these events we further 
select a "gluon-tagged" category, which is excluded from further analysis if the two leading 
jets are consistent with being initiated by gluons, based on three measured quantities. These 
are the number of charged hadronic particle tracks, the number of photon and neutral hadrons, 

and the variable PTD = ^J^/f^Jl^^/prj^ , where the sum is extended over all jet constituents. 
The variable PTD is related to the fragmentation variable z = px (constituent)/ px(jet) and is 
approximately equal to a/I^z^. Gluon hadronization favors the production of a larger number 
of stable particles. This translates into the observation of softer (low PTD), high-multiplicity jets 
when compared to those generated by final-state quarks. We construct a quark-gluon LD from 
the above three observables. The corresponding LD distributions for signal and background are 
shown in figure |2](e). The relative number of gluon- and quark-jets for the main background, 
Z+jets, is not well known and it is not expected to be well reproduced by the simulation. The 
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quark-gluon LD is instead verified using data samples of 7+jets enriched in quark-jets. 

In order to suppress the substantial tt background in the 2 b-tag category, we apply a selection 
on the missing transverse energy (E^'^®) which is defined as the modulus of the negative vector 
sum of all reconstructed PF particles in the event. We construct a discriminant. A, which is the 
ratio of the likelihoods of the hypothesis with £^'** equal to the value measured with the PF 
algorithm and the null hypothesis (£|?^'®® = 0) II62I . This discriminant provides a measure that 
the event contains genuine missing transverse energy. The distribution of 2 In A(E^^®^) is shown 
in figure |2](f). We apply a loose requirement, 21n A(E^^®^) < 10, in the 2 b-tag category only. 
In the low-mass analysis, we instead apply the selection requirement Ef^^'^^ < 50 GeV in the 2 
b-tag category. 

Data and MC predictions of background distributions after the preselection requirements are 
shown in figure |2j where the additional contribution of a Higgs boson signal would be indistin- 
guishable above the overwhelming background. The overall agreement between background 
simulation and data is good except for systematic differences related to the quark-gluon com- 
position in Z+jets events, as shown in figure |2](e). We do not rely directly on simulation for 
background estimates. Instead, the background is determined directly using sidebands in data 
(see Section |3]l. 

The main selection requirements are summarized in table [l] When an event contains multiple 
candidates passing the selection requirements, we retain the one with jets in the highest b-tag 
category for the analysis. Further ambiguity between multiple candidates is resolved selecting 
the candidate with nijj and nin values closest to the Z boson mass niz- The distribution of 
the nizz invariant mass for background and data are displayed for the three b-tag categories 
in figure |3] No significant deviation is observed between the data and the expectation for 
background. The main backgrounds include inclusive Z production with either light-flavor 
or heavy-flavor jets, top-quark production, and diboson production such as WZ and ZZ. The 
expected and observed event yields are listed in table |2] The expected background is quoted 
from the nijj sideband procedure described below and from simulation. In the low-mass range, 
the background distribution is obtained from the nijj sideband while its size is estimated from 
the mzz sideband chosen for each mn hypothesis, as discussed below. 



3 Event Analysis 

Data containing a Higgs boson signal would have a distinct resonance peak in addition to the 
continuum background distribution. The estimates from simulation shown in figure |3] provide 
a good illustration of the expected background but require further validation of both theoretical 
predictions, such as production cross section, and detector effects, e.g. b-tagging efficiency. 
These effects can explain the discrepancies between data and background simulation, which 
are sizable near the ZZ threshold around mzz = 200 GeV. However, the analysis technique 
relies on sidebands measured in data and is largely insensitive to the modeling of the mzz 
distributions. 

In order to minimize the systematic uncertainty from the background models, we estimate the 
background distribution from the nijj sidebands, defined as 60 < nijj < 75 GeV and 105 < 
nijj < 130 GeV. In simulation, the composition and distribution of the dominant backgrounds 
in the sidebands is similar to that in the signal region, 75 < nijj < 105 GeV. The expected num- 
ber of background events, Nbkg(wzzz), is obtained from the number of events in the sidebands. 
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Figure 2: Distribution of the dijet invariant mass nijj (a), angular likelihood discriminant (b), 
b-tagging discriminant (c), flavor tagging category (d), including the gluon-tagged category, 
quark-gluon likelihood discriminant (e), and 21n A(£^'®®) (f). Points with error bars show dis- 
tributions of data after preselection requirements defined in table [T] with an additional require- 
ment 70 < niff < 110 GeV. Solid histograms depict the background expectation from simulated 
events with the different components illustrated. Open histograms indicate the expected dis- 
tribution for a Higgs boson with a 400 GeV mass, multiplied by a factor of 100. 
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Table 1: Summary of kinematic and topological selection requirements. Numbers in parenthe- 
ses indicate additional selection requirements in the mzz range [125, 170] GeV, where angular 
and quark-gluon likelihood discriminant requirements are not used. 





preselection 




leading px > 40(20) GeV, subleading px > 20(10) GeV 


pxOets) 




> 30 GeV 








< 2.5 (e±), < 2.4 (^±) 




1 '/I (jets) 




<2.4 






final selection 




b-tag 


1 b-tag 


2 b-tag 


b-tag 


none 


one loose 


mediimi& loose 


angular LD 


> 0.55 + 0.00025 mzz 


> 0.302 -1-0.000656 mzz 


> 0.5 


quark-gluon LD 


> 0.10 


none 


none 


£miss requirements 


none 


none 


21nA(£^'ssj < 10 








(Ef^i^^ < 50 GeV) 


mjj 




G [75, 105] GeV 








G [70,110] (<80)GeV 




nizz 


G [183, 800] (G [125, 170]) GeV 



Table 2: Observed and expected event yields for 4.6 fb^ of data. The yields are quoted in the 
range 125 < mzz < 170 GeV or 183 < mzz < 800 GeV, depending on the Higgs boson mass 
hypothesis. The expected backgroimd is quoted from the myy sideband procedure and from 
simulation (MC). In the low-mass range, the background is estimated from the mzz sideband 
for each Higgs mass hypothesis and is not quoted in the table. The errors on the expected 
background from simulation include only statistical uncertainties. 





b-tag 


1 b-tag 


2 b-tag 


mzz G [125, 170] 


observed yield 
expected background (myy sideband) 
expected background (MC) 


1087 

1050 ± 54 
1089 ± 39 


360 

324 ± 28 
313 ± 20 


30 
19 ±5 
24±4 


mzz e [183, 800] 


observed yield 
expected background (myy sideband) 
expected background (MC) 


3036 
3041 ±54 

3105 ± 39 


3454 
3470 ± 59 
3420 ±41 


285 
258 ± 17 
255 ± 11 


signal expectation (MC) 


mH=150 GeV 
mH=250 GeV 
mH=350 GeV 
mH=450 GeV 
mH=550 GeV 


10.1 ± 1.5 

24.5 ± 3.5 

29.6 ± 4.3 
16.5 ± 2.4 
6.5 ± 1.0 


4.1 ± 0.6 

21.7 ± 3.0 
26.0 ± 3.7 

15.8 ± 2.2 
6.5 ± 0.9 


I. 6 ± 0.3 
8.1 ±1.7 

II. 8 ± 2.5 
7.9 ± 1.7 
3.6 ± 0.8 
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Figure 3: The mzz invariant mass distribution after final selection in three categories: b- 
tag (top), 1 b-tag (middle), and 2 b-tag (bottom). The low-mass range 120 < mzz < 170 GeV is 
shown on the left and the high-mass range 183 < mzz < 800 GeV is shown on the right. Points 
with error bars show distributions of data and solid curved lines show the prediction of back- 
ground from the sideband extrapolation procedure. In the low-mass range, the background 
is estimated from the mzz sideband for each Higgs mass hypothesis and the average expecta- 
tion is shown. Solid histograms depicting the background expectation from simulated events 
for the different components are shown. Also shown is the SM Higgs boson signal with the 
mass of 150 (400) GeV and cross section 5 (2) times that of the SM Higgs boson, which roughly 
corresponds to expected exclusion limits in each category. 
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Nsh{mzz), as follows: 

r ^ AT / ^ ^bLg("^zz) . ^ , ^ 

M,kg(mzz) = Nsb(mzz) X — = Nsb(mzz) x a(mzz), (1) 

^sb ("^zzj 

where a(mzz) is the ratio of the expected number of background events in the signal and side- 
band regions obtained from simulation. This factor corrects for acceptance differences between 
the two regions and also for differences in background composition. 

In the high-mass range, the distributions derived from data sidebands are measured for each of 
the three b-tag requirements and give the normalization of the background and its dependence 
on nizz- The correction a(mzz) reaches a maximum of about 1.2 near the threshold of Iniz 
and falls to nearly a constant value between 0.75 and 1.0 elsewhere, depending on b-tag and 
kinematic requirements. 

In the low-mass range, below the Iniz threshold, the same kinematic selections are applied to 
all b-tag categories and a single background spectrum is derived from the nijj sidebands. The 
correction oi{mzz) is not applied, and instead the normalizations in each category are obtained 
as a function of myi, using an mzz sideband outside the window niyi ± 5 GeV. 

The results of the sideband extrapolation procedures are shown as solid curves in figure |3] 
and are in good agreement with the observed distributions in data. In all cases, the dominant 
backgrounds include Z+jets with either light- or heavy-flavor jets and top background, both of 
which populate the nijj signal region and the nijj sidebands. The diboson background amounts 
to less than 5% of the total in the and 1 b-tag categories and about 10% in the 2 b-tag category. 
This diboson background is accounted for by a (mzz) in the high-mass range and by the mzz 
sideband procedure in the low-mass range. 

The distribution of mzz for the background is parameterized with an empirical function, fitted 
to the shape and normalization determined from the sidebands. The advantage of this ap- 
proach is that most of the systematic uncertainties on the background cancel. The dominant 
normalization uncertainty in the background estimation is due to statistical fluctuations of the 
number of events in the sidebands. The reconstructed signal distributions are described with 
a two-sided Crystal-Ball function Il63l and an empirical function reflecting misreconstruction 
of the Higgs boson decay products. The signal reconstruction efficiency and the mzz distribu- 
tion are parameterized as a function of mn and are extrapolated to all mass points. The main 
uncertainties in the signal mzz parameterization are due to resolution which is predominantly 
affected by the uncertainty on the jet energy scale ||58| . 

The mzz distributions of the selected events are split into six categories based on the b-tag type 
and the lepton flavor. These events are examined for 43 hypothetical Higgs boson masses in 
a range between 130 GeV and 164 GeV, and 73 hypothetical Higgs boson masses in the range 
between 200 GeV and 600 GeV, where the mass steps are optimized to account for the expected 
width, Fh, and resolution for measurement of mH UMI . For each mass hypothesis, we perform 
a simultaneous likelihood fit of the six mzz distributions using the statistical approaches dis- 
cussed in ref. [64|. As an alternative, we have also studied a cut-based analysis that counts 
events in regions of the mzz distribution and found consistent, but systematically higher me- 
dian expected limits compared to the likelihood fit approach. We adopt the modified frequen- 
tist construction CLg I|64l - l66| as the primary method for reporting limits. As a complementary 
method to the frequentist paradigm, we use the Bayesian approach [67] and find consistent 
results. 

The systematic uncertainties on signal normalization are summarized in table |3] We consider 
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3 Event Analysis 



Table 3: Summary of systematic uncertainties on signal normalization. Most sources give mul- 
tiplicative uncertainties on the cross-section measurement, except for the expected Higgs boson 
production cross section, which is relevant for the measurement of the ratio to the SM expecta- 
tion. The ranges indicate dependence on mn- 



source 


b-tag 


1 b-tag 


2 b-tag 


muon reconstruction 


2.7% 


electron reconstruction 


4.5% 


jet reconstruction 


1-8% 


pile-up 


3-4% 


gmiss 






3-4% 


b-tagging 


2-7% 


3-5% 


10-11% 


gluon-tagging 


4.6% 






acceptance (HqT) 


2% 


5% 


3% 


acceptance (PDF) 


3% 


acceptance (VBF) 


1% 


signal cross section (PDF) 


8-10% 


signal cross section (scale) 


8-11% 


signal shape 


1.5 X 10^''% X mf^ [GeV] 


luminosity 


4.5% 



effects from lepton energy scale, resolution, selection, and trigger (electron/ muon reconstruc- 
tion); jet resolution and efficiency (jet reconstruction); pile-up; £j?^'*® requirements; heavy-quark 
flavor tagging and quark-gluon discrimination; Higgs boson production mechanism; cross sec- 
tion and branching fractions; resonance mass shape; and LHC luminosity. Reconstruction effi- 
ciencies for leptons and their uncertainties are evaluated from data with a "tag-and-probe" |54| 
approach where one lepton from an inclusive sample of Z decays serves as a tag and the effi- 
ciency for the reconstruction of the other lepton is calculated. Contributions from jet reconstruc- 
tion are evaluated by variation of the jet energy and resolution within calibration uncertainties. 
The contributions from the uncertainty on pile-up are taken from the simulated difference be- 
tween the reconstruction and the selection efficiency with pile-up below and above the average 
expected value, distributed according to the measurement in data. The uncertainty of the Ef^^^^ 
selection efficiency is computed by examining the Ef^'^^ distributions from Z inclusive produc- 
tion in MC simulation and in data after subtraction of background from top production. Un- 
certainties due to b tagging have been evaluated with a sample of jet events enriched in heavy 
flavor by requiring a muon to be spatially close to a jet. The uncertainty on the quark-gluon 
LD selection efficiency was evaluated using the 7 + jet sample in data, which predominantly 
contains quark jets. 

Uncertainties in the production mechanism affect the signal acceptance in the detector. Both 
the longitudinal momentum of the Higgs boson, because of PDFs, and the transverse mo- 
mentum of the Higgs boson, because of QCD initial-state radiation effects, are model depen- 
dent. We rescale the transverse momentum distribution of the Higgs boson using the HqT l|68| 
code and take the full change in the efficiency as a systematic uncertainty. We follow the 
PDF4LHC 11521 169H72I recommendation to estimate the uncertainty due to PDF knowledge and 
to calculate the uncertainty on signal acceptance. Uncertainties on the production cross section 
for the Higgs boson are taken from ref. [73|, which includes uncertainties from the QCD renor- 
malization and factorization scales, PDFs, and ocg. These uncertainties are separated between 
the gg and VBF production mechanisms, but uncertainties on the gg process dominate in the 
total production cross section. We also account for a small uncertainty because of a difference in 
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signal acceptance with the gg and VBF production mechanisms, while the selection efficiency 
was optimized and evaluated for the dominant gg production. A relative uncertainty of 4.5% 
on luminosity is applied to the signal normalization. 

Recent studies f39\ O show that current Monte Carlo simulations do not describe the cor- 
rect Higgs boson mass line shape above ~ 300 GeV. These effects are estimated to lead to an 
additional uncertainty on the theoretical cross section of 10-30% for jwh of 400-600 GeV and are 
included in the calculations of the limits. 

We also consider the production and decay of the Higgs boson within a model with four 
generations of fermions (SM4) i37l - l4T| , including electroweak radiative corrections. The fol- 
lowing scenario has been adopted in the SM4 calculations: = 600 GeV and — m^/ = 
50(1 + 0.2 ln(mH/115)) GeV, following recommendation of ref. II73II . The main difference from 
the SM is a higher production rate and somewhat different branching fractions of the SM4 
Higgs boson. We assume that the main uncertainties on the SM4 Higgs production cross sec- 
tion are the same as the gluon-fusion mechanism in the SM [73 J. 

In order to infer the presence or absence of a signal in the data sample, we construct an ap- 
propriate test statistic cj, a single number encompassing information on the observed data, 
expected signal, expected background, and all uncertainties associated with these expecta- 
tions [64]. The definition of q makes use of a likelihood ratio for the signal+background model 
and the model with the best-fit signal strength plus background. We compare the observed 
value of the test statistic with its distributions expected under the background-only and sig- 
nal+background hypotheses. The expected distributions are obtained by generating pseudo- 
datasets. The signal strength which leads to a 95% CL limit is determined for each Higgs mass 
hypothesis under study. 

4 Results 

No evidence for the Higgs boson is found and exclusion limits at 95% CL on the ratio of the 
production cross section for the Higgs boson to the SM expectation are presented in figure |4] 
The observed limits are within expectation for the background-only model. The significance of 
the only local deviation beyond the 95% expectation range around 225 GeV is greatly reduced 
after taking into account the look-elsewhere effect {75\, for which the estimated trial factor 
is about 18 in the high-mass range. Results obtained with the Bayesian approach yield very 
similar limits to those from CLg. 

Limits on the SM production cross section times branching fraction for H — ?► ZZ are pre- 
sented in figure |5] For comparison, expectations are shown for both the SM and for the SM4 
model. The ranges 154—161 GeV and 200-470 GeV of SM4 Higgs mass hypotheses are excluded 
at 95% CL. The exclusion limits in figure |4] are also approaching the cross section for the SM 
expectation for production of the Higgs boson. 

5 Summary 

A search for the SM Higgs boson decaying into two Z bosons which subsequently decay to two 
quark jets and two leptons, H —> ZZ^*) —> qqi^i^, has been presented. Data corresponding 
to an integrated luminosity of 4.6 fb^^ of proton-proton collisions at centre-of-mass energy of 
7 TeV have been collected and analyzed by the CMS Collaboration at the LHC. No evidence for 
a SM-like Higgs boson has been found and upper limits on the production cross section for the 
SM Higgs boson have been set in the range of masses between 130 and 164 GeV, and between 
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125 130 135 140 145 150 155 160 165 170 200 250 300 350 400 450 500 550 600 



mn [GeV] m^^ [GeV] 

Figure 4: Observed (solid) and expected (dashed) 95% CL upper limit on the ratio of the pro- 
duction cross section to the SM expectation for the Higgs boson obtained using the CLg tech- 
nique. The 68% (1(7") and 95% (la) ranges of expectation for the background-only model are 
also shown with green (darker) and yellow (lighter) bands, respectively. The solid line at 1 
indicates the SM expectation. Left: low-mass range, right: high-mass range. 




Figure 5: Observed (dashed) and expected (solid) 95% CL upper limit on the product of the 
production cross section and branching fraction for H — > ZZ obtained with the CLg tech- 
nique. The 68% (Id) and 95% (Icr) ranges of expectation for the background-only model are 
also shown with green (darker) and yellow (lighter) bands, respectively. The expected product 
of the SM Higgs production cross section and the branching fraction is shown as a red solid 
curve with a band indicating theoretical uncertainties at 68% CL. The same expectation in the 
fourth-generation model is shown with a red dashed curve with a band indicating theoretical 
uncertainties. Left: low-mass range, right: high-mass range. 
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200 and 600 GeV. In this analysis we have also excluded at 95% CL a large range of Higgs boson 
mass hypotheses in the model with a fourth generation of fermions having SM-like couplings. 
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